Exploration-Free Policies in Dynamic Pricing and Online Decision-Making

نویسنده

  • Mohsen Bayati
چکیده

Growing availability of data has enabled practitioners to tailor decisions at the individuallevel. This involves learning a model of decision outcomes conditional on individual-specific covariates or features. Recently, contextual bandits have been introduced as a framework to study these online and sequential decision making problems. This literature predominantly focuses on algorithms that balance an exploration-exploitation tradeoff, since greedy policies that exploit current estimates without any exploration may be sub-optimal in general. However, exploration-free greedy policies are desirable in many practical settings where experimentation may be prohibitively costly or unethical.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A DSS-Based Dynamic Programming for Finding Optimal Markets Using Neural Networks and Pricing

One of the substantial challenges in marketing efforts is determining optimal markets, specifically in market segmentation. The problem is more controversial in electronic commerce and electronic marketing. Consumer behaviour is influenced by different factors and thus varies in different time periods. These dynamic impacts lead to the uncertain behaviour of consumers and therefore harden the t...

متن کامل

The Interrelationship between Quality Costs and Pricing Decision-Making: An Exploratory Study on a Sample of Industrial Companies

There is a causal relationship between high-quality cost systems and pricing decision makers because pricing decision is in dire need of modern systems that help make rational decisions. The aim of this research is to confirm that quality cost systems affect pricing decisions-making in maintaining the industrial companies. The research results can be utilized by beneficiaries taking into accoun...

متن کامل

Modelling and Decision-making on Deteriorating Production Systems using Stochastic Dynamic Programming Approach

This study aimed at presenting a method for formulating optimal production, repair and replacement policies. The system was based on the production rate of defective parts and machine repairs and then was set up to optimize maintenance activities and related costs. The machine is either repaired or replaced. The machine is changed completely in the replacement process, but the productio...

متن کامل

CCollaborative Framework for Decision Making Process of the Water Management (Case Study: Kashafrood Basin)

Sophisticated social- ecology systems, such as those in water management in a basin, are usually dynamic, multidimensional, or multidimensional, requiring serious engagement by multiple actors, and decision making in such systems is always faced with serious problems Kashafrood basin in Khorasan Razavi province is one of the most critical aquifers in the country due to the high population growt...

متن کامل

Engineering an Agent-based System for Product Pricing Automation

This paper describes an autonomous agent conceived for automating the decision making process for pricing products. Product pricing involves the interaction of decision makers with different possibly conflicting points of view. Our approach allows for applying individual pricing policies to each product by taking into account different points of view expressed through different arguments and th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016